AITopics | Greenbelt

We report the discovery that binary encoding allows neural networks to extrapolate periodic functions beyond their training bounds. We introduce Normalized Base-2 Encoding (NB2E) as a method for encoding continuous numerical values and demonstrate that, using this input encoding, vanilla multi-layer perceptrons (MLP) successfully extrapolate diverse periodic signals without prior knowledge of their functional form. Internal activation analysis reveals that NB2E induces bit-phase representations, enabling MLPs to learn and extrapolate signal structure independently of position.

activation, extrapolation, representation, (15 more...)

arXiv.org Machine Learning

2512.10817

Country:

North America > United States > Maryland > Prince George's County > College Park (0.14)
North America > United States > Maryland > Baltimore County (0.14)
North America > United States > Maryland > Baltimore (0.14)
(3 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

SolarCube: An Integrative Benchmark Dataset Harnessing Satellite and In-situ Observations for Large-scale Solar Energy Forecasting

Neural Information Processing SystemsOct-9-2025, 17:43:04 GMT

However, the cloud induced-variability of solar radiation reaching the earth's surface presents a challenge for integrating solar power into the grid (e.g., storage and backup management).

forecasting, radiation, solar radiation, (15 more...)

Neural Information Processing Systems

Country:

South America (0.04)
Oceania (0.04)
North America > United States > New York > Broome County > Binghamton (0.04)
(5 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Energy > Renewable > Solar (1.00)
Energy > Power Industry (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Data Science (0.68)
Information Technology > Artificial Intelligence > Vision (0.68)

Add feedback

Chasing the Timber Trail: Machine Learning to Reveal Harvest Location Misrepresentation

Sarkar, Shailik, Yousuf, Raquib Bin, Wang, Linhan, Mayer, Brian, Mortier, Thomas, Deklerck, Victor, Truszkowski, Jakub, Simeone, John C., Norman, Marigold, Saunders, Jade, Lu, Chang-Tien, Ramakrishnan, Naren

arXiv.org Artificial IntelligenceMar-16-2025

Illegal logging poses a significant threat to global biodiversity, climate stability, and depresses international prices for legal wood harvesting and responsible forest products trade, affecting livelihoods and communities across the globe. Stable isotope ratio analysis (SIRA) is rapidly becoming an important tool for determining the harvest location of traded, organic, products. The spatial pattern in stable isotope ratio values depends on factors such as atmospheric and environmental conditions and can thus be used for geographic origin identification. We present here the results of a deployed machine learning pipeline where we leverage both isotope values and atmospheric variables to determine timber harvest location. Additionally, the pipeline incorporates uncertainty estimation to facilitate the interpretation of harvest location determination for analysts. We present our experiments on a collection of oak (Quercus spp.) tree samples from its global range. Our pipeline outperforms comparable state-of-the-art models determining geographic harvest origin of commercially traded wood products, and has been used by European enforcement agencies to identify harvest location misrepresentation. We also identify opportunities for further advancement of our framework and how it can be generalized to help identify the origin of falsely labeled organic products throughout the supply chain.

artificial intelligence, atmospheric variable, machine learning, (13 more...)

arXiv.org Artificial Intelligence

2502.14115

Country:

Europe > Russia (0.05)
Asia > Russia (0.05)
North America > United States > Virginia > Alexandria County > Alexandria (0.05)
(10 more...)

Genre: Research Report (1.00)

Industry:

Materials > Paper & Forest Products (1.00)
Law (1.00)
Government > Regional Government > North America Government > United States Government (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Exoplanet Transit Candidate Identification in TESS Full-Frame Images via a Transformer-Based Algorithm

Salinas, Helem, Brahm, Rafael, Olmschenk, Greg, Barry, Richard K., Pichara, Karim, Silva, Stela Ishitani, Araujo, Vladimir

arXiv.org Artificial IntelligenceFeb-11-2025

The Transiting Exoplanet Survey Satellite (TESS) is surveying a large fraction of the sky, generating a vast database of photometric time series data that requires thorough analysis to identify exoplanetary transit signals. Automated learning approaches have been successfully applied to identify transit signals. However, most existing methods focus on the classification and validation of candidates, while few efforts have explored new techniques for the search of candidates. To search for new exoplanet transit candidates, we propose an approach to identify exoplanet transit signals without the need for phase folding or assuming periodicity in the transit signals, such as those observed in multi-transit light curves. To achieve this, we implement a new neural network inspired by Transformers to directly process Full Frame Image (FFI) light curves to detect exoplanet transits. Transformers, originally developed for natural language processing, have recently demonstrated significant success in capturing long-range dependencies compared to previous approaches focused on sequential data. This ability allows us to employ multi-head self-attention to identify exoplanet transit signals directly from the complete light curves, combined with background and centroid time series, without requiring prior transit parameters. The network is trained to learn characteristics of the transit signal, like the dip shape, which helps distinguish planetary transits from other variability sources. Our model successfully identified 214 new planetary system candidates, including 122 multi-transit light curves, 88 single-transit and 4 multi-planet systems from TESS sectors 1-26 with a radius > 0.27 $R_{\mathrm{Jupiter}}$, demonstrating its ability to detect transits regardless of their periodicity.

light curve, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2502.07542

Country:

North America > United States > Maryland > Prince George's County > College Park (0.14)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Europe > Belgium > Flanders > Flemish Brabant > Leuven (0.04)
(4 more...)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.82)

Add feedback

Scalable Higher Resolution Polar Sea Ice Classification and Freeboard Calculation from ICESat-2 ATL03 Data

Iqrah, Jurdana Masuma, Koo, Younghyun, Wang, Wei, Xie, Hongjie, Prasad, Sushil K.

arXiv.org Artificial IntelligenceFeb-4-2025

ICESat-2 (IS2) by NASA is an Earth-observing satellite that measures high-resolution surface elevation. The IS2's ATL07 and ATL10 sea ice elevation and freeboard products of 10m-200m segments which aggregated 150 signal photons from the raw ATL03 (geolocated photon) data. These aggregated products can potentially overestimate local sea surface height, thus underestimating the calculations of freeboard (sea ice height above sea surface). To achieve a higher resolution of sea surface height and freeboard information, in this work we utilize a 2m window to resample the ATL03 data. Then, we classify these 2m segments into thick sea ice, thin ice, and open water using deep learning methods (Long short-term memory and Multi-layer perceptron models). To obtain labeled training data for our deep learning models, we use segmented Sentinel-2 (S2) multi-spectral imagery overlapping with IS2 tracks in space and time to auto-label IS2 data, followed by some manual corrections in the regions of transition between different ice/water types or cloudy regions. We employ a parallel workflow for this auto-labeling using PySpark to scale, and we achieve 9-fold data loading and 16.25-fold map-reduce speedup. To train our models, we employ a Horovod-based distributed deep-learning workflow on a DGX A100 8 GPU cluster, achieving a 7.25-fold speedup. Next, we calculate the local sea surface heights based on the open water segments. Finally, we scale the freeboard calculation using the derived local sea level and achieve 8.54-fold data loading and 15.7-fold map-reduce speedup. Compared with the ATL07 (local sea level) and ATL10 (freeboard) data products, our results show higher resolutions and accuracy (96.56%).

artificial intelligence, classification, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2502.027

Country:

North America > United States > Colorado > Boulder County > Boulder (0.14)
Southern Ocean > Ross Sea (0.06)
Antarctica (0.04)
(4 more...)

Genre:

Workflow (1.00)
Research Report > New Finding (0.86)

Industry:

Information Technology (0.46)
Government > Space Agency (0.35)
Government > Regional Government > North America Government > United States Government (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

WxC-Bench: A Novel Dataset for Weather and Climate Downstream Tasks

Shinde, Rajat, Phillips, Christopher E., Ankur, Kumar, Gupta, Aman, Pfreundschuh, Simon, Roy, Sujit, Kirkland, Sheyenne, Gaur, Vishal, Lin, Amy, Sheshadri, Aditi, Nair, Udaysankar, Maskey, Manil, Ramachandran, Rahul

arXiv.org Artificial IntelligenceDec-3-2024

High-quality machine learning (ML)-ready datasets play a foundational role in developing new artificial intelligence (AI) models or fine-tuning existing models for scientific applications such as weather and climate analysis. Unfortunately, despite the growing development of new deep learning models for weather and climate, there is a scarcity of curated, pre-processed machine learning (ML)-ready datasets. Curating such high-quality datasets for developing new models is challenging particularly because the modality of the input data varies significantly for different downstream tasks addressing different atmospheric scales (spatial and temporal). Here we introduce WxC-Bench (Weather and Climate Bench), a multi-modal dataset designed to support the development of generalizable AI models for downstream use-cases in weather and climate research. WxC-Bench is designed as a dataset of datasets for developing ML-models for a complex weather and climate system, addressing selected downstream tasks as machine learning phenomenon. WxC-Bench encompasses several atmospheric processes from meso-$\beta$ (20 - 200 km) scale to synoptic scales (2500 km), such as aviation turbulence, hurricane intensity and track monitoring, weather analog search, gravity wave parameterization, and natural language report generation. We provide a comprehensive description of the dataset and also present a technical validation for baseline analysis. The dataset and code to prepare the ML-ready data have been made publicly available on Hugging Face -- https://huggingface.co/datasets/nasa-impact/WxC-Bench

artificial intelligence, deep learning, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2412.0278

Country:

Atlantic Ocean (0.04)
Pacific Ocean (0.04)
North America > United States > Alabama > Madison County > Huntsville (0.04)
(13 more...)

Genre: Research Report (1.00)

Industry:

Transportation > Air (1.00)
Energy (1.00)
Government > Regional Government > North America Government > United States Government (0.88)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Short-Period Variables in TESS Full-Frame Image Light Curves Identified via Convolutional Neural Networks

Olmschenk, Greg, Barry, Richard K., Silva, Stela Ishitani, Powell, Brian P., Kruse, Ethan, Schnittman, Jeremy D., Cieplak, Agnieszka M., Barclay, Thomas, Solanki, Siddhant, Ortega, Bianca, Baker, John, Mamani, Yesenia Helem Salinas

arXiv.org Artificial IntelligenceOct-2-2024

The Transiting Exoplanet Survey Satellite (TESS) mission measured light from stars in ~85% of the sky throughout its two-year primary mission, resulting in millions of TESS 30-minute cadence light curves to analyze in the search for transiting exoplanets. To search this vast dataset, we aim to provide an approach that is both computationally efficient, produces highly performant predictions, and minimizes the required human search effort. We present a convolutional neural network that we train to identify short period variables. To make a prediction for a given light curve, our network requires no prior target parameters identified using other methods. Our network performs inference on a TESS 30-minute cadence light curve in ~5ms on a single GPU, enabling large scale archival searches. We present a collection of 14156 short-period variables identified by our network. The majority of our identified variables fall into two prominent populations, one of short-period main sequence binaries and another of Delta Scuti stars. Our neural network model and related code is additionally provided as open-source code for public use and extension.

light curve, partition, short-period variable, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.3847/1538-3881/ad55f1

2402.12369

Country:

North America > United States > Maryland > Prince George's County > College Park (0.14)
North America > United States > California > Alameda County > Berkeley (0.14)
South America > Chile (0.04)
(2 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.47)

Add feedback

The Palomar twilight survey of 'Ayl\'o'chaxnim, Atiras, and comets

Bolin, B. T., Masci, F. J., Coughlin, M. W., Duev, D. A., Ivezić, Ž., Jones, R. L., Yoachim, P., Ahumada, T., Bhalerao, V., Choudhary, H., Contreras, C., Cheng, Y. -C., Copperwheat, C. M., Deshmukh, K., Fremling, C., Granvik, M., Hardegree-Ullman, K. K., Ho, A. Y. Q., Jedicke, R., Kasliwal, M., Kumar, H., Lin, Z. -Y., Mahabal, A., Monson, A., Neill, J. D., Nesvorný, D., Perley, D. A., Purdum, J. N., Quimby, R., Serabyn, E., Sharma, K., Swain, V.

arXiv.org Artificial IntelligenceSep-23-2024

Near-sun sky twilight observations allow for the detection of asteroid interior to the orbit of Venus (Aylos), the Earth (Atiras), and comets. We present the results of observations with the Palomar 48-inch telescope (P48)/Zwicky Transient Facility (ZTF) camera in 30 s r-band exposures taken during evening astronomical twilight from 2019 Sep 20 to 2022 March 7 and during morning astronomical twilight sky from 2019 Sep 21 to 2022 Sep 29. More than 46,000 exposures were taken in evening and morning astronomical twilight within 31 to 66 degrees from the Sun with an r-band limiting magnitude between 18.1 and 20.9. The twilight pointings show a slight seasonal dependence in limiting magnitude and ability to point closer towards the Sun, with limiting magnitude slightly improving during summer. In total, the one Aylo, (594913) 'Ayl\'o'chaxnim, and 4 Atiras, 2020 OV1, 2021 BS1, 2021 PB2, and 2021 VR3, were discovered in evening and morning twilight observations. Additional twilight survey discoveries also include 6 long-period comets: C/2020 T2, C/2020 V2, C/2021 D2, C/2021 E3, C/2022 E3, and C/2022 P3, and two short-period comets: P/2021 N1 and P/2022 P2 using deep learning comet detection pipelines. The P48/ZTF twilight survey also recovered 11 known Atiras, one Aylo, three short-period comes, two long-period comets, and one interstellar object. Lastly, the Vera Rubin Observatory will conduct a twilight survey starting in its first year of operations and will cover the sky within 45 degrees of the Sun. Twilight surveys such as those by ZTF and future surveys will provide opportunities for discovering asteroids inside the orbits of Earth and Venus.

angular distance, recovery, survey field, (13 more...)

arXiv.org Artificial Intelligence

2409.15263

Country:

North America > United States > Arizona > Pima County > Tucson (0.14)
North America > United States > Washington > King County > Seattle (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
(24 more...)

Genre: Research Report (0.82)

Industry: Energy (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.34)

Add feedback

Neural Surrogate HMC: Accelerated Hamiltonian Monte Carlo with a Neural Network Surrogate Likelihood

Wolniewicz, Linnea M, Sadowski, Peter, Corti, Claudio

arXiv.org Artificial IntelligenceJul-29-2024

Bayesian Inference with Markov Chain Monte Carlo requires efficient computation of the likelihood function. In some scientific applications, the likelihood must be computed by numerically solving a partial differential equation, which can be prohibitively expensive. We demonstrate that some such problems can be made tractable by amortizing the computation with a surrogate likelihood function implemented by a neural network. We show that this has two additional benefits: reducing noise in the likelihood evaluations and providing fast gradient calculations. In experiments, the approach is applied to a model of heliospheric transport of galactic cosmic rays, where it enables efficient sampling from the posterior of latent parameters in the Parker equation.

flux, likelihood, likelihood function, (14 more...)

arXiv.org Artificial Intelligence

2407.20432

Country:

North America > United States > Hawaii > Honolulu County > Honolulu (0.05)
North America > United States > Maryland > Prince George's County > Greenbelt (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre: Research Report (0.64)

Industry:

Government > Regional Government > North America Government > United States Government (0.48)
Government > Space Agency (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.91)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.36)

Add feedback